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4 <110> APPLICANT: Genencor International, Inc. 

5 Jones, Brian E. 

6 Grant, William D. 

7 Heaphy, Shaun 

8 Grant, Susan 

9 Rees, Helen 

11 <120> TITLE OF INVENTION: Novel Bacillus mHKcel Cellulase 

14 <130> FILE REFERENCE: GC797-2-PCT 
C--> 16 <140> CURRENT APPLICATION NUMBER: US/10/549 , 603 
C--> I? <141> CURRENT FILING DATE: 2005-09-20 

19 <T50>" PRIOR APPLICATION NUMBER: US 60/467,315 *' " 

20 <151> PRIOR FILING DATE: 2003-04-30 
22 <160> NUMBER OF SEQ ID NOS : 3 

24 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

26 <210> SEQ ID NO: 1 

27 <211> LENGTH: 3796 

28 <212> TYPE: DNA 

29 <213> ORGANISM: Bacillus sp. 

31 <220> FEATURE: 

32 <221> NAME/KEY: misc_f eature 

33 <222> LOCATION: (1) . . . (3796) 

34 <223> OTHER INFORMATION: isolated from environmental water sample from 

35 Sonachi Lake, Kenya 

37 <221> NAME /KEY : misc_feature 

38 <222> LOCATION: (1) . . . (3796) 

39 <223> OTHER INFORMATION: n = A,T,C or G 
W--> 41 <400> 1 

42 atctttagag atgagcgtga tagtcagtgg accttaaaag aacttaaaag atttttagaa 60 

43 gagattaaaa cggacccgca ccatctctct gtttattttg atgggggatt tgatttggag 120 

44 acacaacgat ctggccttgg gtgtgtgatt tattatgaac aaaatgacac gtcttaccgg 180 

45 gtgagaaaaa acgctaccgt ggcgtcattg acatcgaata accaagcaga atatgccgct 240 

46 ttgcatttag gacttaaaag aacttgaagg gatcgggcgc atcatctgcc tatcaccatt 3 00 

47 tacggtgatt ctcaagttgt gatcaatcag ttaaaaggag aatgggccgt gtatggagga 360 

48 ggtgttaaat aaatgggctg acccgtattg atcaagcatt tagctaaatt aggcatgacc 420 

49 gctacttata agttaatccc ccgtaaagaa aaccgcgaag ccgatcaact ggctacacaa 480 

50 gcgttaaacg ggcaagaaat tataagtcaa cgtgatatca gtgagcgtgg tgcagattag 540 

51 gctgcaccgc gcaaaaaaag tcaacgtgtt taggaatgga cagggattaa agcaacataa 600 

52 ttctctctaa gcaaacgttg cgacagcaag agagaagcat ataaggtttt tctgagttag 660 

53 tctatttata ccaatgtcca cgtactaaat aaacctctca tcaaagtgga ttttttgatt 720 

54 aattcacttc cactcctacc tttatctata taaattagtt ccttttttgt taataatcac 780 

55 taattttggc ggtatttttt aatagaaata tatgctagat tataaactag tagccgtata 840 

56 gaaggtggtg attgccccta taagagacgt ctggcaaaca taaaagcatc gcattattat 900 

57 aatcgaaagg tggagatgag acatgggtta tacccaagct aagtgtatgg tgaaaaaaac 960 
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58 ggtcttgttt ggtttaattc tctgtttagg tgtgtcaatg tttgtaccag ttacatcagc 1020 

59 tgaagatagg gtctcttcgt cacaggtgga tatccaatca tatgtagcag atatgcaacc 1080 

60 tggctggaat ttaggtaata catttgatgc gataggagat gatgaaacag catggggaaa 1140 

61 ccctcgtgta acgagagaat taatagaaat gattgctgat gaagggtata aaagtattcg 1200 

62 tatcccagtc acatggcaaa atcaaatggg tggttctcca gattatacaa ttaatgaaga 1260 

63 ttatatcaag cgggtagagc aagtgataga ttgggcgttg gaggaagact tgtatgtgat 1320 

64 gttaaatgtg catcatcjact catggctgtg gatgtatgat atggaacata actatgatga .0 380 

65 ggtgatggca agatatacag ctatttggga acaattgtcg gaaaaattca aaaaccactc 1440 * 

66 ccataagttg atgtttgaga gtgtcaatga gcctaggttt acgcaggagt ggggagagat 1500 

67 tcaagaaaat catcatgctt acttagaaga tttaaataag acgttctatt atattgtcag 1560 

68 agagtcagga ggcaataatg tggagcgccc tttagtattg cctacgatag aaacagccac 1620 

69 gtctcaggat ttactagatc gcttgtatca aacaatggaa gacttggatg accctcattt 1680 

70 aattgccacg gttcattatt atggcttttg gccctttagt gtcaatatag cagggtacac 1740 

71 ccgttttgaa caggagacac aacaagatat tatagacacg tttgaccgtg ttcataacac 1800 

72 atttacagcg aatgggatcc cagttgtatt aggtgaattt ggtttgttag gctttgataa 1860 

73 aagtacggac gtcattcagc aaggtgagaa attaaaattt tttgagtttc tcatccatca 1920 

74 tctcaatgaa cgtgatataa cccatatgtt atgggataac ggtcagcatt taaagcgaga 1980 

75 aacttattca tggtatgatc aggaatttca tgacatatta aaagcgagtt gggaggggcg 2 040 

76 ttctgctaca g.ctgagtcta atttcattca tgtgaaggac. ggagagccaa ttagagatca 2.100 - 

77 acatatacag ctttacttaa' acggaaatga gctaactgcc ctacaggcag gggacgaatc 2160 

78 gcttgtacta ggagaggatt atgagctagc aggagacgta ttaacgctaa aagcgggcat 2220 

79 cctcacaaga ttaattaccc ctggccaatt aggaacgaat gcggtcatca cagctcaatt 2280 

80 taattctgga gcagactggc gttttcaatt acagaatgtg gacgtgccaa cagtcgaaaa 2340 

81 tacagatggc tcaatatggc attttgcgat ccctacccat tttaatggtg atagtcttgc 2400 

82 gacgatggaa gctgtttatg caaacggaga atatgctggc ccgcaagatt ggacgtcatt 2460 

83 taaagaattt ggcgaggcgt tttcccctaa ttacgccaca ggggaaatta ttataacaga 2520 

84 agccttcttt aacgcggtac gggatgatga tatccattta acatttcatt attggagcgg 2580 

85 agagacggtg gaatatacat tacgtaaaaa tggaaattat gttcaaggta gacggtaaca 2640 

86 tgattttaat taatagataa aacagcctac ctatcgtttt tggaagaagg caaacgaatc 2700 

87 tcatcttacc aacaccgtgc tttagaactt tagaagtgac ggtgtttttt aagacatgag 2760 

88 gagagacaat cctctatcaa cagtcaccaa tttttattca ggaggtgtca agttatctaa 2820 

89 cgttctatga atgcatatag tttctgacga ataaacatag ttaaaaagaa gtgagcctag 2 880 

90 ttcccgaggg gaaggggata atgccaacgt attggattaa agtaccttct tgataaaaag 2 940 

91 aaagggtttt caagaggtgg aaatgggctc gtttgttata ctttaattac accttggaac 3 000 

92 gtcattttgg cggtgctact tagtaagatg actgacatca taaaagagga gtgggttcga 3 060 

93 tggctttaat tcaattaagc tttaaatcac gagcattaat gttgcaaacc tctgtcaatg 3120 

94 ttttattacc ggtgggaatg aatgcggtag attttacacc aagtgatgat ttttcttatg 3180 

95 ttactgaccc ttttcctgtc ctatatcttt tgcatggtgc aactgatgat tattcagcat 3240 

96 ggctacgtct gtcctccatt gaacgatatg ctgaagaaaa aaaattggcg gtcgtcatgc 3300 

97 caaatgctga tatgagtgcg tatacggata tggtacatgg acatcgttac tggacgtata 3360 

98 ttagtaagga gctgcctgag tttatcaaag cgacttttcc tatttctcag caccgtgaag 3420 

99 acacctttgc ggctggtctg tctatgggag gatacggggc ttttaaatta gcgttgcggc 3480 

100 aaccggaacg cttcgctgca gntgtgtcat tatcaggtgc agttgatatg agagaagcaa 3540 

101 gtcaaccaga ctccctattt gtgaacgcat ttggtgaagg gacgaaaatc gcagggacag 3600 

102 atcttgatct ttttcattta attaaaaagt tgggggtata tgaaggggct aaaccagccc 3660 

103 tttttcaagc gtgtgggaca gaggactttt tatatgaaga taatgtgaga tttagagatt 3 720 

104 atgcacgaca agtgaatgcc gatttaactt atgaagaagg tcctggtggt catgaatggg 3780 

105 cttattggga tagaat 3796 
107 <210> SEQ ID NO: 2 
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108 <211> LENGTH: 1716 

109 <212> TYPE: DNA 

110 <213> ORGANISM: Bacillus sp. 

112 <220> FEATURE: 

113 <221> NAME /KEY : misc_feature 

114 <222> LOCATION: (1) . . . (1716) 

115 <223> OTHER INFORMATION: isolated from environmental water sample 
116" Sonachi Lake, Kenya ' ~ " - 

118 <400> SEQUENCE: 2 

119 atgggttata cccaagctaa gtgtatggtg aaaaaaacgg tcttgtttgg tttaattctc 

120 tgtttaggtg tgtcaatgtt tgtaccagtt acatcagctg aagatagggt ctcttcgtca 

121 caggtggata tccaatcata tgtagcagat atgcaacctg gctggaattt aggtaataca 

122 tttgatgcga taggagatga tgaaacagca tggggaaacc ctcgtgtaac gagagaatta 

123 atagaaatga ttgctgatga agggtataaa agtattcgta tcccagtcac atggcaaaat 

124 caaatgggtg gttctccaga ttatacaatt aatgaagatt atatcaagcg ggtagagcaa 

125 gtgatagatt gggcgttgga ggaagacttg tatgtgatgt taaatgtgca tcatgactca 

126 tggctgtgga tgtatgatat ggaacataac tatgatgagg tgatggcaag atatacagct 

127 atttgggaac aattgtcgga aaaattcaaa aaccactccc ataagttgat gtttgagagt 

128 gtcaatgagc ctaggtttac gcaggagtgg ggagagattc aarraaaatca tcatgcttac 

129 ttagaagatt taaataagac gttctattat attgtcagag agtcagg'agg caataatgtg 

130 gagcgccctt tagtattgcc tacgatagaa acagccacgt ctcaggattt actagatcgc 

131 ttgtatcaaa caatggaaga cttggatgac cctcatttaa ttgccacggt tcattattat 

132 ggcttttggc cctttagtgt caatatagca gggtacaccc gttttgaaca ggagacacaa 

133 caagatatta tagacacgtt tgaccgtgtt cataacacat ttacagcgaa tgggatccca 

134 gttgtattag gtgaatttgg tttgttaggc tttgataaaa gtacggacgt cattcagcaa 

135 ggtgagaaat taaaattttt tgagtttctc atccatcatc tcaatgaacg tgatataacc 

136 catatgttat gggataacgg tcagcattta aagcgagaaa cttattcatg gtatgatcag 

137 gaatttcatg acatattaaa agcgagttgg gaggggcgtt ctgctacagc tgagtctaat 
13 8 ttcattcatg tgaaggacgg agagccaatt agagatcaac atatacagct ttacttaaac 
13 9 ggaaatgagc taactgccct acaggcaggg gacgaatcgc ttgtactagg agaggattat 

140 gagctagcag gagacgtatt aacgctaaaa gcgggcatcc tcacaagatt aattacccct 

141 ggccaattag gaacgaatgc ggtcatcaca gctcaattta attctggagc agactggcgt 

142 tttcaattac agaatgtgga cgtgccaaca gtcgaaaata cagatggctc aatatggcat 

143 tttgcgatcc ctacccattt taatggtgat agtcttgcga cgatggaagc tgtttatgca 

144 aacggagaat atgctggccc gcaagattgg acgtcattta aagaatttgg cgaggcgttt 

145 tcccctaatt acgccacagg ggaaattatt ataacagaag ccttctttaa cgcggtacgg 

146 gatgatgata tccatttaac atttcattat tggagcggag agacggtgga atatacatta 

147 cgtaaaaatg gaaattatgt tcaaggtaga cggtaa 

149 <210> SEQ ID NO: 3 

150 <211> LENGTH: 571 

151 <212> TYPE: PRT 

152 <213> ORGANISM: Bacillus sp. 

154 <220> FEATURE: 

155 <221> NAME/KEY: VARIANT 

156 <222> LOCATION: (1) . . . (571) 

157 <223> OTHER INFORMATION: isolated from environmental water sample 

158 Sonachi Lake, Kenya 

160 <400> SEQUENCE: 3 

161 Met Gly Tyr Thr Gin Ala Lys Cys Met Val Lys Lys Thr Val Leu Phe 



60 
120 
180 
240 
300 
360 
420 
480 
540 
60C 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1716 



file : //C : \CRF4\Outhold\VsrJ549603 . htm 



12/1/2006 



Page 4 of 8 



RAW SEQUENCE LISTING DATE: 12/01/2006 

PATENT APPLICATION: US/10/549 , 603 TIME: 14:19:56 

Input Set : A:\797_2_PCT_SeqList.TXT 
Output Set: N:\CRF4\12012006\J549603.raw 



162 


1 






5 










10 










15 




163 


Gly Leu 


He 


Leu 


Cys 


Leu 


Gly 


Val 


Ser 


Met 


Phe 


Val 


Pro 


Val 


Thr 


Ser 


164 






20 










25 










30 






165 Ala Glu Asp 


Arg Val 


Ser 


Ser 


Ser 


Gin 


Val 


Asp 


He 


Gin 


Ser 


Tyr 


Val 


166 




35 










40 










45 








167 Ala Asp Met 


Gin Pro Gly 


Trp 


Asn 


Leu 


Gly 


Asn 


Thr 


Phe 


Asp 


Ala 


He 


168 


50 










55 










60 










169 Gly Asp Asp 


Glu 


Thr 


Ala 


Trp 


Gly 


Asn 


Pro 


Arg 


Val 


Thr 


Arg 


Glu 


Leu 


170 


65 








70 










75 










80 


171 


He Glu 


Met 


He Ala Asp 


Glu 


Gly 


Tyr 


Lys 

2 


Ser 


He 


Arq 


He 


Pro 


Val 


172 








85 










90 










95 




173 


Thr Trp 


Gin 


Asn 


Gin 


Met 


Gly 


Gly 


Ser 


Pro 


Asp 


Tyr 


Thr 


He 


Asn 


Glu 


174 






100 










105 










110 






175 Asp Tyr 


He 


Lys Arg 


Val 


Glu 


Gin 


Val 


He 


Asp 


Trp 


Ala 


Leu 


Glu 


Glu 


176 




115 










120 










125 








177 


Asp Leu 


Tyr 


Val 


Met 


Leu 


Asn 


Val 


His 


His 


Asp 


Ser 


Trp 


Leu 


Trp 

tr 


Met 


178 


130 










135 










140 










179 


Tyr Asp Met 


Glu 


His 


Asn 


Tyr 


Asp 


Glu 


Val 


Met 


Ala 


Arq 


Tvr 


Thr 


Ala 


180 


145 








150 










155 










1.60 : 


181 


He Trp 


Glu 


Gin 


Leu 


Ser 


Glu 


Lys 


Phe 


Lys 


Asn 


His 


Ser 


His 


Lys 


Leu 


182 








165 










170 










175 




183 


Met Phe 


Glu 


Ser 


Val 


Asn 


Glu 


Pro 


Arq 


Phe 


Thr 


Gin 


Glu 


Trp 


Gly 


Glu 


184 






180 










185 










190 






185 


He Gin 


Glu 


Asn 


His 


His 


Ala 


Tyr 


Leu 


Glu 


Asp 


Leu 


Asn 


Lys 


Thr 


Phe 


186 




195 










200 










205 








187 


Tyr Tyr 


He 


Val 


Arg 


Glu 


Ser 


Gly 


Gly 


Asn 


Asn 


Val 


Glu 


Arq 


Pro 


Leu 


188 


210 










215 










220 










189 


Val Leu 


Pro 


Thr 


He 


Glu 


Thr 


Ala 


Thr 


Ser 


Gin 


Asp 


Leu 


Leu 


Asp 


Arg 


190 


225 








230 










235 










240 


191 


Leu Tyr 


Gin 


Thr 


Met 


Glu 


Asp 


Leu 


Asp 


Asp 


Pro 


His 


Leu 


lie 


Ala 


Thr 


192 








245 










250 










255 




193 


Val His 


Tyr 


Tyr Gly Phe 


Trp 


Pro 


Phe 


Ser 


Val 


Asn 


He 


Ala 


Gly 


Tyr 


194 






260 










265 










270 






195 


Thr Arg 


Phe 


Glu 


Gin 


Glu 


Thr 


Gin 


Gin 


Asp 


lie 


He 


Asp 


Thr 


Phe 


Asp 


196 




275 










280 










285 








197 Arg Val 


His 


Asn 


Thr 


Phe 


Thr 


Ala 


Asn 


Gly 


lie 


Pro 


Val 


Val 


Leu 


Gly 


198 


290 










295 










300 










199 


Glu Phe 


Gly 


Leu Leu Gly 


Phe 


Asp 


Lys 


Ser 


Thr 


Asp 


Val 


He 


Gin 


Gin 


200 


305 








310 










315 










320 


201 Gly Glu Lys 


Leu 


Lys 


Phe 


Phe 


Glu 


Phe 


Leu 


lie 


His 


His 


Leu 


Asn 


Glu 


202 








325 










330 










335 




203 


Arg Asp 


He 


Thr 


His 


Met 


Leu 


Trp 


Asp 


Asn 


Gly 


Gin 


His 


Leu 


Lys 


Arg 


204 






340 










345 










350 






205 


Glu Thr 


Tyr 


Ser 


Trp 


Tyr 


Asp 


Gin 


Glu 


Phe 


His 


Asp 


He 


Leu 


Lys 


Ala 


206 




355 










360 










365 








207 


Ser Trp 


Glu 


Gly Arg 


Ser 


Ala 


Thr 


Ala 


Glu 


Ser 


Asn 


Phe 


He 


His 


Val 


208 


370 










375 










380 










209 


Lys Asp Gly 


Glu 


Pro 


lie 


Arg 


Asp 


Gin 


His 


He 


Gin 


Leu 


Tyr 


Leu 


Asn 


210 


385 








390 










395 










400 



f ile : //C : \CRF4\Outhold\VsrJ549603 . htm 



12/1/2006 



Page 5 of 8 



RAW SEQUENCE LISTING DATE: 12/01/2006 

PATENT APPLICATION: US/10/549,603 TIME: 14:19:56 



Input Set : A:\797_2_PCT_SeqList.TXT 
Output Set: N:\CRF4\12012006\J549603.raw 



211 


Gly Asn Glu Leu 


Thr 


Ala 


Leu 


Gin 


Ala 


Gly 


Asp 


Glu Ser 


Leu 


Val 


Leu 


212 








405 










410 








415 




213 


Gly Glu Asp Tyr 


Glu 


Leu 


Ala 


Gly 


Asp 


Val 


Leu 


Thr Leu 


Lys Ala Gly 


214 






420 










425 








430 






215 


He 


Leu 


Thr Arg 


Leu 


He 


Thr 


Pro 


Gly 


Gin 


Leu 


Gly Thr 


Asn 


Ala 


Val 


216 






435 








440 








445 








217 


He 


Thr 


Ala Gin 


Phe 


Asn 


Ser 


Gly 


Ala 


A^p 


Trp 


Arg Phe 


Gin 


Leu 


Gin 


218 




450 








455 










460 








219 


Asn 


Val 


Asp Val 


Pro 


Thr 


Val 


Glu 


Asn 


Thr 


Asp 


Gly Ser 


He 


Trp 


His 


220 


465 








470 










475 








480 


221 


Phe 


Ala 


He Pro 


Thr 


His 


Phe 


Asn 


Gly 


Asp 


Ser 


Leu Ala 


Thr 


Met 


Glu 


222 








485 










490 








495 




223 


Ala 


Val 


Tyr Ala 


Asn 


Gly 


Glu 


Tyr 


Ala 


Gly 


Pro 


Gin Asp 


Trp 


Thr 


Ser 


224 






500 










505 








510 






225 


Phe 


Lys 


Glu Phe 


Gly 


Glu 


Ala 


Phe 


Ser 


Pro 


Asn 


Tyr Ala 


Thr Gly Glu 


226 






515 








520 








525 








227 


He 


He 


He Thr 


Glu 


Ala 


Phe 


Phe 


Asn 


Ala 


Val 


Arg Asp 


Asp 


Asp 


He 


228 




530 








535 










540 








229 


His 


Leu 


Thr Phe 


His 


Tyr 


Trp 


Ser 


Gly 


Glu 


Thr 


Val Glu 


Tyr 


Thr 


Leu 


230 


545 








550 










-555- 








560 


231 


Arg Lys Asn Gly 


Asn 


Tyr 


Val 


Gin 


Gly 


Arg 


Arg 










232 








565 










570 













file ://C:\CRF4\Outhold\VsrJ549603 .htm 



12/1/2006 



Page 6 of 8 



RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/10/549 , 603 



DATE: 12/01/2006 
TIME: 14:19:57 



Input Set : A:\7 97_2_PCT_SeqList.TXT 
Output Set: N:\CRF4\12012006\J549603.raw 



Please Note: 
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